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Abstract —Polar codes are the latest breakthrough in coding 
theory, as they are the first family of codes with explicit con¬ 
struction that provably achieve the symmetric capacity of discrete 
memoryless channels. Ankan’s polar encoder and successive can¬ 
cellation decoder have complexities of TV log TV, for code length 
TV. Although, the complexity bound of TV log TV is asymptotically 
favorable, we report in this work methods to further reduce the 
encoding and decoding complexities of polar coding. The crux is to 
relax the polarization of certain bit-channels without performance 
degradation. We consider schemes for relaxing the polarization of 
both very good and very bad bit-channels, in the process of channel 
polarization. Relaxed polar codes are proved to preserve the ca¬ 
pacity achieving property of polar codes. Analytical bounds on the 
asymptotic and finite-length complexity reduction attainable by 
relaxed polarization are derived. For binary erasure channels, we 
show that the computation complexity can be reduced by a factor 
of 6, while preserving the rate and error performance. We also 
show that relaxed polar codes can be decoded with significantly 
reduced latency. For AWGN channels with medium code lengths, 
we show that relaxed polar codes can have lower error probabili¬ 
ties than conventional polar codes, while having reduced encoding 
and decoding computation complexities. 

I. Introduction 

Polar codes, introduced by Arikan G), 0, are the most 
recent breakthrough in coding theory. Polar codes are the first 
and, currently, the only family of codes with explicit construc¬ 
tion (no ensemble to pick from) to asymptotically achieve the 
capacity of symmetric discrete memoryless channels as the 
block length goes to infinity. Besides their obvious application 
in error correction, recent research have shown the possibility of 
applying polar codes and the polarization phenomenon in var¬ 
ious communications and signal processing problems such as 
data compression J4), [5], BICM channels JS), wiretap channels 
0, multiple access channels HD-ED, and broadcast channels 
DU. There have also been various modified constructions of 
polar codes for the different applications, such as generalized 
polar codes Ifl2ll . compound polar codes fl~3l , concatenated 
polar codes |[T4l . and universal polar codes [15[ . 

Polar codes can be encoded and decoded with relatively 
low complexity. Both the encoding complexity and the suc¬ 
cessive cancellation (SC) decoding complexity of polar codes 
are O (TV log TV), for code length TV (2). The decoding latency 
and memory requirements of polar decoders can be reduced to 
O(N) I fl6l - fll 8| . Hardware architectures for polar decoders, 
with O(N) memory and processing elements, were imple¬ 
mented COD. A semi-parallel architecture for SC decoding 
has been recently proposed |17], where efficiency is achieved 
without a significant throughput penalty by sharing processing 
resources and taking advantage of the regular structure of polar 
codes. The encoding and decoding latencies of polar codes 
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can also be reduced to O(N), through multi-dimensional polar 
transformations ED. Alamdar and Kschischang proposed a 
simplified successive cancellation decoder with reduced latency 
and computational complexity by simplifying the decoder to 
decode all bits in a rate-one or a rate-zero constituent code 
simultaneously [T9). Reduction in decoding latency can also be 
achieved by changing the code construction, such as through 
interleaved concatenation of shorter polar codes m. 

In this paper, we propose methods to reduce both the encod¬ 
ing and decoding computational complexities of polar codes, by 
means of relaxing the channel polarization. The resultant codes 
are called relaxed polar codes. Hence, hardware implementa¬ 
tions for the encoders and decoders of relaxed polar codes can 
require smaller area and less power consumptions than conven¬ 
tional polar codes. Efficient methods for the implementation of 
the SC decoder, as in lfl6l — fTHl . can also be applied to further 
improve the efficiency of decoding relaxed polar codes. 

In practical scenarios, codes have finite block lengths and 
are designed with a specific information block length and rate 
in order to satisfy a certain error rate. Due to the nature of 
channel polarization, the error probability of certain bit chan¬ 
nels decrease (or increase) exponentially at each polarization 
step. Hence, the encoding and decoding complexities can be 
reduced by relaxing the polarization of certain channels if their 
polarization degrees hit suitable thresholds, while satisfying the 
code rate and error rate requirements. For Arikan’s polar code 
with length TV, each bit-channel is polarized log TV times. How¬ 
ever, for the proposed relaxed polar codes, some bit channels 
will be fully polarized log TV times, and the polarization of the 
remaining bit-channels will be relaxed, where their polarization 
is aborted if they become sufficiently good or sufficiently bad 
with less than log TV polarization steps. Relaxed polarization 
results in fewer polarizing operations, and hence a reduction in 
complexity. It is found that with careful construction of relaxed 
polar codes, there is no error performance degradation. In fact, 
it is observed that relaxed polar codes can have a lower error 
rate than conventional polar codes with the same rate. 

The rest of this paper is organized as follows. In Sec- 
tion|IIl we give an overview of channel polarization theory 
and construction of conventional polar codes, which we call 
fully polarized codes. In Section|III] the notion of relaxed 
channel polarization is introduced and the relaxed channel 
polarization theory is established. The asymptotic bounds on 
the complexity reduction using relaxed polar codes are dis¬ 
cussed in SectionlTVl Then, upper bounds and lower bounds 
on the complexity reduction at finite block lengths are derived 
in Section[V] These bounds are evaluated and compared with 
the actual complexity reductions at certain code parameters, in 
Section [V-Dl Constructions of relaxed polar codes for general 
channels, and decoders for relaxed polar codes are discussed 
in Section[VT] The relation between the relaxed polar code 
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construction and the simplified successive cancellation decoder 
(SSCD) is discussed in Section IVI-CI Numerical simulations 
on the AWGN channels are presented in Section [VI-DI The 
paper is concluded in Section lVTII 


II. Arikan’s Fully Polarized Codes 


For any binary-input discrete memoryless channel (B-DMC) 
W : 3F —> £F, let W{y\x) denote the probability of receiving 
y given that x G SF = {0,1} was sent, for any x £ 3F and 
j gf'. For an B-DMC IF, the Bhattacharyyaparameter of IF 
is 

Z(W) d ^ f Y, VW(y\0)W(y\l). (1) 

y<Z& 


The symmetric capacity of a B-DMC W can be written as 


i(w) = Y l E w(y\x)io gJ 


W{y\x) 


x^SC 


W(y |0) 


W(y\l) 


( 2 ) 


For a binary memoryless symmetric (BMS) channel with 
uniform input, the error probability of W can be characterized 
as 

E(W) = - £ min{W(t/|0), W(y |1)}. (3) 

yfZW 

The Bhattacharyya parameter Z(W) can be shown to be always 
between 0 and 1. The Bhattacharyya parameter can be regarded 
as a measure of the reliability of W. Channels with Z (IF) close 
to zero are almost noiseless, while channels with Z(W ) close 
to one are almost pure-noise channels. More precisely, it can 
be proved that the probability of error of an BMS channel is 
upper-bounded by its Bhattacharyya parameter |[20ll 


0 < 2 E(W) < Z(W) < 1. (4) 


The construction of polar codes is based on a phenomenon 
called channel polarization discovered by Arikan j2j. Consider 
the polarization matrix 


Consider the 2x2 polarizing transformation F which takes 
two independent copies of W and performs the mapping 
(IF, W) (W~,W+), where W : {0,1} ->• *3f, W~ : 
{0,1} —> ?F 2 , and IF + : {0,1} -A {0,1} x ^ 2 , then 
polarization is defined with the channel transformation 

W~(y 1 ,y 2 \x 1 ) = - Y w (yi\ x iOx 2 )W(y 2 \x 2 ), (6) 

ai2G{0,l} 

W + (y 1 ,y 2 ,Xi\x 2 ) = ^W{yi\x\ © x 2 )W(y 2 \x 2 ), (7) 

where W~ and IF + are degraded and upgraded channels 
respectively. Hence, the following is true for the bit-channel 
rates (2), 

I{W + ) + I{W~) = 2/(IF) (8) 

I(W~) < I(W) < I(W+). (9) 

The mapping of (IT} W) —A (IF - , 1F + ) is called one level 
of polarization. The same mapping is applied to IF - and W + 
to get IF , IF l ", IF -1 , IF ++ , which is the second level 


of polarization of IF. The same process can be continued in 
order to polarize IF for any arbitrary number of levels. The 
polarization process can be also described using a binary tree, 
where the root of the tree is associated with the channel IF. 
Each node in the binary tree is associated with some bit-channel 
IF' and has two children, where the left child corresponds to 
IF ~ and the right child corresponds to IF + . 

The channel polarization process can be also represented 
using the Kronecker powers of F defined as follows. F® 1 = F 
and for any i > 1, 


F®(i- 1 ) 0 

F®(i- 1 ) 


where F® 1 is a 2* x 2* matrix. Let n = log 2 N. Then, the N x N 
polarization transform matrix is defined as Gn = /TyF®'"', 
where Bn is the bit-reversal permutation matrix 10. Let u^ 
denote the vector (ui,u 2 , ■ • •, itjv) of N independent and uni¬ 
form binary random variables. Let be transmit¬ 

ted through N independent copies of a binary-input discrete 
memoryless channel (B-DMC) IF to form channel output . 
Let W N : 3F N -A ?F ,V denote the channel that results from 
N independent copies of IF in the polar transformation i.e. 
W N (y^\xi) = nti W( Vi \ Xi ). The combined channel W N 
is defined with transition probabilities given by 

W N (y?K) 4 W N (y? KGjv) = W N (y? |uf B N F® n ). 

( 10 ) 

This is the channel that the random vector observes through 
the polar transformation. 

Assuming uniform channel input and a genie-aided succes¬ 
sive cancellation decoder, the bit-channel IF^ is defined with 
the following transition probability: 

Y W n{vi\ui)- 

(0 (11 > 

Notice that IF}, gives the transition probabilities of Ui as¬ 
suming all the preceding bits u'} 1 are already decoded and 
are available, together with the N observations at the channel 
output y± . This is actually the channel that Ui observes and is 
also referred to as the ?'-th bit-channel. It can be observed that 
IFy corresponds to the i-node in the n- th level of polarization 
of IF. The following recursive formulas hold for Bhattacharyya 
parameters of individual bit-channels in the polar transforma¬ 
tion J2J 

Z{W^- X) ) < 2Z{W®) - Z(wtf) 2 ( 12 ) 

Z(W^) = Z(Wf) 2 (13) 


with equality iff IF is a binary erasure channel. 

The channel polarization theorem states that as the code 
length N goes to infinity, the bit-channels become polarized, 
meaning that they either become noise-free or very noisy. 
Define the set of good bit-channels according to the channel 
W and a positive constant 13 < y 2 as 

Qn(W,P) d = {* e [N] : Z(W ( ; ] ) < 2~ nP /N } , (14) 
where [N] = {1, 2,.... N }, then the main channel polariza- 
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tion theorem follows 0, 0: 

Theorem 1: For any BMS channel W, with capacity C(W), 
and any constant f3 < y 2 . 


lim 

N—foo 


\Gn(W,(3)\ 

N 


C(W). 


TheoremQ]readily leads to a construction of capacity-achieving 
polar codes. The crux of polar codes is to carry the information 
bits on the upgraded noise-free channels and freeze the de¬ 
graded noisy channels to a predetermined value, e.g. zero. The 
following theorem shows the error exponent under successive 
cancellation decoding (2): 

Theorem 2: Let IT be a BMS channel and let k = 
\Gn{W, 0) | be the cardinality of the information bits, which are 
encoded using a polar code of length N, and transmitted over 
W N , then the probability of decoder error under successive 
cancellation decoding satisfies P e < 'Y2 iG g N ^ w ^Z(W^) < 
2 ~ nP . 

Similar to the set of good bit-channels, the set of bad bit- 
channels is defined according to the channel W and a positive 
constant (3 < V 2 as 


Bn{W, (3) = f { * G [N] : Z(W$) > 1 - 2~ nP } , (15) 


The following corollary can be derived by specializing the 
Theorem 3 of EQ: 

Corollary 3: Let W be an arbitrary BSM channel. Then, for 
any positive constant j3 < 1 / 2 , 


lim 

N—f oo 


\Bn{W,P)\ 

N 


1 -C{W). 


(16) 


III. Relaxed Polarization Theory 

In this section, we define relaxed polarization. We prove that, 
similar to conventional polar codes, relaxed polar codes can 
asymptotically achieve the capacity of a binary memoryless 
symmetric channel. We also prove that the bit-channel error 
probability of relaxed polar codes is at most that of conventional 
polar codes without rate-loss. 

Let us observe the definition of good channels in TheoremQ] 
Let us also observe that the Bhattacharrya parameters (BP) 
approach 0 or 1 exponentially with the block length N. Let 
W denote the bit-channels of the relaxed polar code. Consider 
two independent copies of a parent bit-channel i at polarization 
level t to be polarized into two bit-channel children at level 
t + 1, corresponding to a code of length 2 t+1 , via the following 
channel transformation 

(W $, W*?) -A (wgl ^, W §&) . (17) 

— ’(i) 

Consider the case, when the polarized channel Wf at level 
t < n, where n = log N, is sufficiently good, such that it 
satisfies the definition of a good channel at the target length 2 n , 
i.e. Z(W 2 P) < 2~ N? /N. Then, the idea of relaxed polarization 
is to stop further polarization of this good channel, and the 
corresponding node in the polarization tree is called a relaxed 
node, such that the channels of all the descendents of a relaxed 
node are the same as that of the relaxed parent node and 
will also be relaxed. Let w[ 0 and u\ e denote the sub-vectors 


with odd and even indices, respectively. Then, the bit-channel 
transformation at the relaxed node is given by 


ur( 2i_1 ) 
vv 2 t + 1 


/ 2* 
(2/i 




\u2i-l) = 

_1 |««) = 


w£ ] {y? ,ullo 2 \u 2 i -i) 

W£(y% + + \,u?- 2 \u 2i ). 


Relaxing the further polarization of sufficiently good chan¬ 
nels is called good-channel relaxed polarization. For the good- 
channel relaxed polar code, define the set of good bit-channels 
according to the channel W and a positive constant (3 < y 2 as 

Qn(W,P) = {i£ [N] : Z(W f) < 2-^7 N}. (18) 

Next, we show that relaxed polar codes, similar to fully po¬ 
larized codes, asymptotically achieve the capacity of BMS 
channels. 


Theorem 4: For any BMS channel W, with capacity C(W), 
and any constant 6 < y 2 , 


lim 

N—too 


Gn(W,P) 

N 


C(W). 


Proof: Consider a relaxed channel at level t < n, where 
n = log IV. Then Z(W$) < 2~ nP /N. Then the BPs of all 
its 2 n ~ t descendents at level n are equal to ZfW^t) and are 
in Gn(W, j3). In case of full polarization, if a channel belongs 
to G(W, /3), then it must have polarized to a good channel at 
level n or earlier. If itpo lari zed at level n, then by definition it 
also belongs to Gn{W, 0). Otherwise, its parent has polarized 
to a good channel at level t < n. With relaxed polarization, this 
channel and all its 2 n ~ t — 1 siblings will also be in Gn(W, 0). 
Therefore, G(W,/3) C Gn(W,/3) and hence \Gn(W,/3)\ > 
\G(W, 0)\. Then, the proof follows from TheoremQ] ■ 

The upper bound 2 on the probability of error as in 
Theorem[2]is still valid for the relaxed polar code constructed 
with respect to Gn{W,0). Hence, Theorem|4] shows that it is 
possible to construct good-channel relaxed polar codes, which 
are still capacity achieving. 

The remaining question is to actually compare the bit-error 
rate of relaxed polar code with that of Ankan’s polar code. Con¬ 
sider the special case when G N (W , 0) = Gn{W, , (3). Consider 
the channel W^ 2 , then we have the following inequalities (the 
proof is provided in the Appendix.) 

e(w^~ 1) ) = 2E(w^ /2 )-2E(w^ /2 ) 2 (19) 

e(w% i) ) > 2 E(W§ 2 ) 2 ( 20 ) 

Consider a good-channel relaxed polar code with good-channel 
set Gn(W,(3 ), which is assumed to be equal to the good 
channel set of the fully polarized polar code, i.e., Gn(W, 0) = 
Gn(W., 0). Consider the last level of channel polarization e.g. 

channel and its children, and assum- 

(A 

ing that W^ 2 is a relaxed node. Then, both indices 2i — 1 

and 2 i are contained in Gn{W,/3) and Gn(W,/3). For the 
relaxed code, it follows that sum error probability of these two 
channels is E + E (w^) = 2£ (w$ 2 ) = 
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2 E {w^j 2 ) ■ Together with summing (fl9] > and ( [20l> . it follows 

that 2 E (w$ /2 ) < E + E (w^. Therefore, 

we have the following lemma. 

Lemma 5: Let a good-channel relaxed polar code have a 
good-channel set Gn{W, (3), which is equal to the good chan¬ 
nel set of the fully polarized polar code, i.e., Gn(W,/3) = 
Gn{W,P), then 

E e(w^) < E E ( W n)- ( 21 ) 

i£GN(W,t 3 ) i&0N(Wj3) 

Note that the left hand side of (|2TT > is the union bound on the 
frame error probability (FER) of the constructed relaxed polar 
code while the bound is very tight for low FERs. Similarly, 
the right hand side of (|2TT > is the union bound on the frame 
error probability (FER) of the constructed relaxed polar code 
which is also very tight for low FERs. Hence, we conclude that 
the relaxed polar code is expected to perform better than the 
fully polarized codes in terms of frame error rate. This will be 
verified in Section [VI-DI 

Remark 1: The concept of good-channel relaxed polariza¬ 
tion, discussed so far, can be extended to bad-channel relaxed 
polarization as follows. Consider the bit-channels in the polar¬ 
ization tree, at a level t < n, that are very bad. A bit-channel 
can be considered very bad if its Bhattacharyya parameter is 
very close to 1, or if none of its descendents will fall into the set 
of good bit-channels at the last level of polarization n. Hence, 
the polarization of very bad bit-channels can be stopped without 
affecting the final set of good bit-channels. Thus, with careful 
bad-channel relaxed polarization, more complexity reductions 
can be possible, without degrading the code rate and error 
performance. 

Remark 2: The obvious advantage of relaxed polarization 
is savings in both encoding and decoding complexities, since 
there will be no channel transformations done at relaxed nodes 
while encoding, and there will be no need to calculate new 
likelihood ratios (LRs) at relaxed nodes while decoding. Hence, 
relaxed polarization will result in a reduction in the encoding 
and decoding computational complexities of polar codes. An¬ 
other advantage of relaxed polarization is the reduction in space 
(area and memory) complexity in practical implementations. 
That is because decoding of relaxed polar codes will result 
in smaller LRs (or log-likelihood ratios (LLRs) in case the 
computation is done in the log domain mil) than that for 
fully polarized polar codes, and hence smaller bit-width will be 
required for LR calculation and storage. Relaxed polarization 
has the effect of reducing the number of processing nodes 
required at lower levels of the polarization tree, and hence 
one can expect even more efficient implementations (or less 
throughput penalty) with semi-parallel hardware architectures 
sra. Also, by appropriate permutation of the bit channels, 
one expects to be able to eliminate the wiring for the wider 
butterflies in FFT-like SC decoders for relaxed polar codes. The 
reduction in bit-width and number of processing nodes required 
for relaxed polar code decoders has the compound effect of 
reduction in power consumption. 

The reduction in encoding and decoding complexity will be 
addressed in the following section. 


IV. Asymptotic Analysis of Complexity Reduction 

In this section, we establish bounds on the asymptotic com¬ 
plexity reduction (as the code’s block length goes to infinity) 
in polar code encoders and decoders, made possible by relaxed 
polarization. 

First, we elaborate the notion of complexity reduction. For 
Ankan’s polar codes, the total number of channel polarization 
operations required using Ankan’s butterfly polarization struc¬ 
ture, is 

A(n) = nN, 


where N = 2" is the length of the code. As a result, the 
encoding procedure consists of nN binary XOR operations 
and decoding procedure consists of nN LLR combinations. 
Therefore, each skipped polarization operation is equivalent 
to one unit of complexity reduction in both encoding and 
decoding, where the unit corresponds to a binary XOR when 
encoding and an LLR combining operation when decoding. 
The complexity reduction 7 Z(n) is defined to be the ratio of 
the number of polarization operations that are skipped due 
to relaxed polarization to the total number of polarization 
operations, A(n), required for full polarization. The complexity 
reduction (CR) can be directly translated into encoding and 
decoding complexity ratios of (1 — TZin)) -1 . 

For the asymptotic analysis throughout this section, a family 
of capacity-achieving polar codes is assumed which is con¬ 
structed with respect to a fixed parameter /3 < y 2 and the set of 
good bit-channels Gn(W., f3), for any block length N = 2™. 

Theorem 6: Let C{W) be the capacity of the channel W. 
Then, for any e > 0, small enough 6 > 0, and large enough 
N, the complexity reduction ratio using the relaxed polar code, 
constructed with Gn(W, /3), is at least 

(C7-<:)(!-(2+ <5)/3) 


Proof: Pick a fixed <5 such that 0 < A < 1//3 — 2. Consider 
the polarization level [~(2 + 8)/3ri \. Let N' = 2^ 2+s ' , ^ n ^ be 
the total number of nodes at this level. Then for large enough 
n, the nodes at this level with index belonging to the set 
Gn 1 (IF, 1/(2 + 5)) will be relaxed. Notice that the fraction 
of these nodes, i.e. \Gn’{W, 1/(2 + 5))| /N', approaches the 
capacity C by TheoremQ] The fraction of bit-channel polar¬ 
izations between the level |"(2 + 8)j3n\ and the last level n is 
(1 — (2 + 8)0) of the total nN, among which a fraction of 
C — e of them are relaxed, for large enough n. Therefore, the 
complexity reduction will be at least (C — e) (l — (2 + 8)0 ). ■ 

In the next theorem, a bound on the asymptotic complexity 
reduction using the bad-channel relaxed polarization is pro¬ 
vided. The following scenario is considered for bad-channel 
relaxed polarization: if none of the descendants of a certain 
node will belong to Gn(W,0, then the polarization at that 
node, and consequently all of its descendants, will be relaxed. 

Theorem 7: For any e > 8 > 0 and large enough N = 2 n , 
the complexity reduction ratio using the bad-channel relaxed 
polarization is at least 


(1 -C 



(2 + 8) log n 


Proof: Consider the polarization level t = [(2 + 5) log n). 
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Then by Corollary^ for large enough a, the fraction of nodes 
with Bhattacharyya parameter at least 1 — 2 >1—2 " 

is at least 1 — C — e. Consider such a node V with Bhattacharyya 
parameter Z > 1 — 2~ n . Then the best descendant of V at the 
last level of polarization has Bhattacharyya parameter 

Z 2"- t > > (! _ 2 _ra ) 2 " > 

e 

which implies that it can not be a good-bit-channel. Therefore, 
the total fraction of complexity reduction is at least 


and the theorem follows. ■ 

Observe that, by neglecting e and 5 in the bounds given in 
Theorem[6]and Theorem[7]and by assuming large enough n, the 
complexity reduction ratio from good-channel and bad-channel 
relaxed polarization is C(1 — 2/3) and 1 — C, respectively, 
which are both positive constant factors. By combining both 
good and bad channel relaxation, the ratio of saved operations 
approaches 1 — 2/3C. Hence, relaxed polarization can provide a 
non-vanishing scalar reduction in complexity, even as the code 
length grows infinitely. 

V. Finite Length Analysis of Complexity 
Reduction 

In this section, we derive bounds on the complexity reduction 
resulted from good-channel and bad-channel relaxed polariza¬ 
tion at finite block lengths. 


A. Relaxed polar code constructions using Bhattacharyya pa¬ 
rameters 

In general, finite block length polar codes are constructed 
by fixing either a target frame error probability (FER) E or 
target code rate R. We consider construction of polar codes 
with code length N = 2 n , at a target FER of E. To simplify 
notation, let Z^t = Z(W,jt > ). At finite block lengths, we need 
to specify certain thresholds for Bhattacharyya parameters in 
order to establish criteria for good-channel and bad-channel 
relaxed polarization. As a result, the following scenarios are 
considered for relaxed polarization: 

1) Good-Channel Relaxed Polarization (GC-RP): 

Node i at polarization level t is not further polarized if 

Zi, t < Tg 

2) Bad-Channel Relaxed Polarization (BC-RP): 

Node i at polarization level t is not further polarized if 

Zj, t t > % 

3) All-Channel Relaxed Polarization (AC-RP): 

Node i at polarization level t is not further polarized if 

Z i>t < T g or Z i>t > Tb 

where T g and % are thresholds that can be considered as 
parameters of the construction. 

Remark 3: If T g = 2 E/N, then GC-RP constructed codes 
satisfy the target FER E. Let F be the set of good bit-channel 
channels which are used to transmit the information bits. Then, 
this can be observed by Q and the fact that |T| < N, which 


Relaxed Polarization Complexity Reduction, Target FER=0.01 



Fig. 1. Complexity reduction via relaxed polarization on erasure channels at 
n = 20 


gives the following 

FER < z t,n /2 < |r| E/N < E. (22) 

ter 

In the proposed bad-channel relaxed polarization (BC-RP), the 
bad channels are not further polarized if they become suffi¬ 
ciently bad, where the bad-channel relaxation threshold can be 
set to be % = 1 Tg. To guarantee no rate loss from BC- 
RP, it should only be performed if n — t < [Tog 2 |°| 2 ^ ]. The 
proposed all-channel relaxed polarization (AC-RP) relaxes the 
polarization of a bit-channel if it becomes either sufficiently 
good or bad. Since the bad bit-channels do not contribute to 
the FER, the target FER is still maintained with AC-RP. 

In Fig. [I] the achieved complexity reduction ratio for a binary 
erasure channel with erasure probability p, BEC(p), is shown. It 
is observed that up to 85% CR is achievable, i.e. fully polarized 
(FP) code requires 6.6-fold the complexity of RP code. AC- 
RP will result in more complexity reduction than GC-RP as the 
channel becomes worse. The rate-loss is calculated as 

Rloss = Rfp — Rrp, (23) 

where Rfp and Rpp are the rates of the codes which are con¬ 
structed by full and relaxed polarization, respectively. The rate 
at a target FER E is calculated by aggregating the maximum 
number of bit-channels such that their accumulated BPs does 
not exceed 2 E. In this simulation result shown in Fig. [T) the rate 
loss is always less than 10” 4 . Another important observation, 
from Fig. Q] is the symmetry of the CR curve around p = 0.5. 
This is explained by the following Theorem[8j which is a direct 
result of Lemma[9]and the description of GC-RP and BC-RP. 

Theorem 8: The complexity reduction with bad-channel re¬ 
laxed polarization of BEC(p) with threshold 1 — T is the same 
as that of good-channel relaxed polarization of BEC(1 —p) with 
threshold T. 

Lemma 9: Let the nodes in the polarization tree be labeled 
by their BPs. Then, the polarization tree for BEC(p) and the 
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Fig. 2. Upper bound (UB) on Complexity Reduction. 


Fig. 3. Lower bound (LB1) on Complexity Reduction. 


polarization tree for BEC(1 — p) are isomorphic, where a node 
V with BP Z in the first tree is isomorphic to a node with BP 
1 — Z in the second tree. 

Proof: We show that the one-to-one mapping is nothing 
but mirroring i.e. the i-th node at the polarization level t will 
be mapped to the node indexed by 2 1 — i at the same level. 
It is sufficient to show this for one polarization level and then 
the rest follows from induction. Let W\ — BEC(p) and Wf = 
BEC(1 — p). Then 

Z(W+) = (1 -p) 2 = 1 - (2 p-p 2 ) = 1 - Z(Wf) 

And 

Z(W 2 ~) =2(l-p)~(l-p) 2 = l-p 2 = l- Z(W+) 

Therefore, by induction on the polarization level, it is shown 
that each polarized node V in the polarization tree of W\ can be 
mapped to the polarization tree of W -2 by reversing the sequence 
of +’s and —’s during its polarization. Furthermore, the BP Z 
of V will be mapped to BP 1 — Z at the image of V. ■ 


B. Analysis of complexity reduction for GC-RP 

In this subsection, bounds on the complexity reduction from 
good-channel relaxed polarization are discussed. Let T = 77,• 
In the next theorem, a simple upper bound is provided, which 
is also illustrated in Fig. [2] 

Theorem 10: The good-channel relaxed polarization com¬ 
plexity reduction is upper bounded by 


Tlg(n) <{n-t g )/n, (24) 


where t g 
ter. 



log 2 T 
l°g 2 P 


and p is the erasure channel parame- 


Proof: The upper bound follows by considering the mini¬ 
mum number of polarization levels required for the best polar¬ 
ized channel to reach the threshold 77 Notice that Z\ (J = p. 
Then, after t polarization levels, the minimum BP among all 
Z it is indeed Zpt = p 2 . Hence, t g polarization levels are 
required for the BP of at least one bit channel to be less than 77 
The upper bound on saved operations follows by skipping all 
polarization steps at all remaining n — t g levels. ■ 

Next, we derive lower bounds on the complexity reduction 
with relaxed polarization for BEC(p). For any polarization 


level t and some threshold B, let Gi:.t denote the set of 
bit channels at polarization level t with BP at most B i.e. 

Gs y t = {i '■ Zi,t < B}. 


Theorem 11: Let t b = 


, for an arbitrary thresh- 


log 9 1 ,° S2 B 
b2 log 2 P 

old B. For a polarization level t 1 > t b , let 7 = 1 ^ 1 / 2 ^. 
Then the good-channel relaxed polarization complexity reduc¬ 
tion is lower bounded by 

lZ g (ri) > 72 ~ tr (n — t r — tf)/n 


Inp- iHIaX 

10 S2 log 2 B 


where t r = 

Proof: Notice that t 1 > t b guarantees that Gs,t is a non¬ 
empty set. In polarization level t r + / 7 , any node in Gis.t-, has 
at least one descendant with BP less than 77 i.e. the right-most 
descendant which has BP at most B 2 r <77 Therefore, there 
are at least -y2G = 1 nodes at polarization level t r + f 7 

that have BP less than 77 and will be relaxed. Relaxing each of 
these nodes is equivalent to skipping S = (n—f r —f 7 ) 2 n ~ tr ~G 
polarization steps. Then the total number of polarization steps 
skipped is 72 ^5', and the proof follows. 


Corollary 12: Let t g = 




Then the good- 


channel relaxed polarization complexity reduction is lower 
bounded by 

B-gin) > 2 ~ tg (n — t g )/n 

Proof: The corollary follows by taking B = p, and t 1 = 
t b = 0 in TheorempTl ■ 

The following provides a tighter lower bound on the GC-RP 
complexity reduction, which is also illustrated in Figure [4] 


Theorem 13: Let t b = 


log 2 


+ 1 , for an ar- 


log? B -1 

=2 log 2 p 

bitrary threshold B. For a polarization level t > t b , let 
7' = min t > tb | {i odd : i £ GB,t} 1/2*. Then the good-channel 
relaxed polarization complexity reduction is lower bounded by 

Y 2 _tr 

lZ g (n) > —-- ((n — t r )(n — t r — 2t + 1 ) + t(t — 1 )) 


2 n 


where t r = 


iog 2 T 


l0 §2 1 ZffB 

Proof: Consider the right-most node at polarization level 
t — 1 which has BP p 2 * 1 < p 2±b 1 < B/2. Therefore, the 
left child of this node is contained in Gs,t which means that 
the set of odd-indexed nodes in T B,t is always non-empty for 
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Fig. 4. Lower bound (LB2) on Complexity Reduction. 


t > tb- The right-most descendant of any of these nodes, after 
t r more polarization levels, will have BP less than T and will 
be relaxed by eliminating the polarizing subtrees emanating 
from them. The total reduction of polarization steps for each of 
these relaxed nodes is at least 5(f) = y'2 t 2 n ~ t ~ tr {n — t — t r ) 
polarization steps. The bound follows by summation of 5(f) for 
all t with tb < t < n — t r . Notice that since the right-polarized 
children of those odd-indexed nodes at t are even indexed, they 
are not counted among the odd-indexed nodes in Qn.t', for any 
other t'. ■ 

Notice that in the above lower bound, a necessary condition 
is that t + 2t r > n to guarantee that no double counting occurs. 
In many practical operation scenarios, this condition holds. If 
the condition does not hold, one can modify the bound by 
limiting the computed summation to Y^to=t S'(fo)- 

Remark 4: In TheoremfTTl for large enough f 7 , the parame¬ 
ter 7 is independent of B and is approximately equal to C, the 
capacity of the underlying channel W. Also, 7 ' in TheoremfTTl 
is approximately C/2. For our practical applications, we can 
always pick a proper value of B such that these approximations 
still hold at the desired block length. 


C. Analysis of complexity reduction for AC-RP 

In this subsection, we analyze the complexity reduction from 
bad-channel relaxed polarization, as well as all-channel (both 
good and bad) relaxed polarization. As opposed to the previous 
subsection, we limit our attention in this subsection to binary 
erasure channels (BEC), wherein the exact computation of 
Bhattacharrya parameters is applicable at finite block lengths. 

Throughout this subsection, we always assume the channel 
BEC(p). For a function F(jp), let F t {p) denote the output from 
recursive application of the function F t times, with initial input 
P- 

Theorem 14: The bad-channel relaxed polarization com¬ 
plexity reduction is upper bounded by 

lZ b (n) < {n-t b )/n, (25) 

where tb = min t {F*(p) > 1 — T} and F(p) =2p — p 2 . 

Proof: The left child of a node with BP Z is associated 
with a bit-channel with BP F(Z). Hence, it requires tb polar¬ 
ization levels for the worst left-polarized bit-channel to have 


BP greater than 1 — T. The rest of the proof follows as for the 
GC-RP case. ■ 

Theorem fl4l can also be proved by combining the results of 
Theorem[ 8 ]and Theorem [TOl The bounds derived for the good- 
channel relaxed polarization in the previous subsection, can 
be turned into bounds for bad-channel relaxed polarization of 
BEC(p) by replacing p with 1 — p i n the bounds, and modifying 
other parameters accordingly. Hence, to avoid writing similar 
proofs, we only mention the theorems and skip the proofs. 


Let t g = 


tb = 


log 2 


log 2 T 
2 log 2 P 
log? T 


log 2 


and tb as in TheoremfTTl In fact. 


. Observe that if p > 0.5, then tb < t g . 


=2 log 2 1 -p 

if p < 0.5, then tb > t g , and if p = 0.5, then tb = t g . Com¬ 
bining TheorcmITTI with upper-bounds on GC-RP of Theorem 
fTO]results in the following upper bound on AC-RP complexity 
reduction. 

Corollary 15: The all-channel relaxed polarization com¬ 
plexity reduction is upper bounded by 


lZ a (n) <(n- t')/n, 


(26) 


where t' = t g for p < 0.5 and t' = tb forp > 0.5. 

The next theorem can be also regarded as the counterpart of 
TheoremfTTl for bad-channel relaxed polarization. 

log? G 


Theorem 16: Let t r = 


lOg; 


2 log 2 (l-p) 


for an arbitrary 


threshold G. For a polarization level tp > t c , let /? = |{i : 
Zi.tp > 1 — G} |/ 2 G. Then, the bad-channel relaxed polariza¬ 
tion complexity reduction is lower bounded by 

K g (n) > /32~ tl (n-ti- tp)/n 

where f = |"log 2 

For AC-RP, the lower bounds of TheoremQT]and Theoremfl 6 l 
can be combined as in the following corollary. It is assumed that 
the set of relaxed nodes in GC-RP and the set of relaxed nodes 
in BC-RP do not intersect. This is a valid assumption as long as 
the good-channel and bad-channel relaxation thresholds, T and 
1 — T, are far apart enough, as characterized in subsection IV-AI 
Corollary 17: The all-channel relaxed polarization is lower 
bounded by 

Tla(n) > ^ (/32~ tl (n - ti — tp) + y2~ tr (n — t r — t 7 )) . 


Theorem 18: Let t c = 


loo- ^G - 1 
1C, g2 log 2 (l-p) 


1 , for an arbi¬ 


trary threshold G. For a polarization level t > t c , let /3' = 
min t >t o | {?' odd : Z t)t > 1 — G} |/2 4 . Then, the bad-channel 
relaxed polarization complexity reduction is lower bounded by 

Tlb{n) > (( n ~ t l)i n ~ti~2t+l) + tit - 1 )) 

where f = [log 2 . 

Similar to TheoremfT3l the above theorem holds under the 
condition that t + 2f > n. Also, similar to Corollary [19] 
the following corollary holds by combining Theorem [13] and 
Theorem [T 8 l To make notations consistent, let tf denote the 
level t in Theorem[l3land t' H denote the level t in Theorem[l 8 l 

Corollary 19: The all-channel relaxed polarization is lower 
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Good-Channel Relaxed Polarization Bounds, Target-FER =1e-005 



Fig. 5. Bounds on the good-channel relaxed polarization complexity reduction 
with target FER of 10“ 5 . 


bounded by 

Ra(ji) >——— (jn — ti)(n — ti — 2 t'p + 1) + t'p — t'p'j 



D. Numerical evaluation of complexity reduction by relaxed 
polarization on erasure channels 

In this subsection, we compute the complexity reduction of 
different scenarios of relaxed polarization over binary erasure 
channels and compare them with the bounds provided in this 
section. 

The block length of the constructed relaxed polar code is 
assumed to be TV = 2 20 and the FER of E = 10 -5 is assumed 
for the code construction. The erasure probability p will be 
varying between 0 and 1. We have observed that the thresholds 
B = 2 p—p 2 and 1 —G = p 2 will result in desired values for 7 in 
Theoremim and /3 in Theorem[l6l With these values of B and 
G we have 7 = 1 — p and f3 = p. We have also observed that 7' 
in TheoremfT3land If in Theoremp~8lcan be well approximated 
by 7/2 and /3/2. The results of Fig. |5]show that actual CR of 
GC-RP can be characterized using the derived upper and lower 
bounds, and up to 70% complexity reduction is achievable at a 
target FER of 10 -5 . 

The performance of AC-RP is analyzed in Fig. [6] at the 
same target FER of 10 -5 , where the analytical bounds are 
compared to the numerical results from actual construction. 
It is observed that the bounds give a good approximation of 
the actual complexity reduction. GC-RP is effective with good 
channel parameters and BC-RP is more effective with bad 
channel parameters. The bounds are also minimized at p = 0.5, 
and symmetric around p = 0.5. This can be explained by the 
symmetry property of Theorem[8] 


VI. Relaxed Polarization on General BMS 
Channels 

In this section, we describe how a code can be constructed 
and decoded on general binary memoryless channels using 
relaxed polarization. 

A. Construction of relaxed polar codes on general BMS chan¬ 
nels 

For general binary memoryless channels, the Bhattacharyaa 
parameters are exponentially hard to compute as block length 
increases. This is due to the exponential output alphabet size 
of the polarized bit-channels. Instead of exact calculation of 
Bhattacharyya parameters, they can be well approximated by 
bounding the output alphabet size of bit-channels via chan¬ 
nel degrading and channel upgrading transformations (22) . 
The channel degrading and upgrading operations provide tight 
lower bounds and upper bounds on the corresponding Bhat¬ 
tacharyya parameters. In order to construct polar codes for 
continuous-output BMS channels (e.g. additive white Gaussian 
noise (AWGN) channels), the channel can be first quantized. 
Then, the degrading and upgrading operations will be per¬ 
formed for the bit-channels resulting from polarization of the 
quantized channel (22) . For AWGN channels, the bit channel 
error probability (BC-EP) can also be reasonably approximated 
using density evolution and a Gaussian approximation (23) ■ 
Alternatively, for short codes, the BC-EP can be numerically 
evaluated using Monte-Carlo simulations, assuming a genie- 
aided SC decoder. For generality of description, let the error 
probability of the j-th bit-channel at the /-th polarization level 
be bounded by 

E_tj < Et,j < E t j (27) 

where E_ t is the probability of error of the upgraded version of 
Wfl and Etj is the probability of error of its degraded version. 

The values of E_ t ■ and E t j can be computed using upgraded 
and degraded versions of polarization tree. In the upgraded 
polarization tree, after each level of polarization the resulting 
bit-channels will be upgraded to have a limited output alphabet 
size. At the next level, the upgraded bit-channels will be po¬ 
larized. As a result, all the bit-channels in the upgraded polar¬ 
ization tree will have a limited output alphabet size. Therefore, 
E_ t ■ can be easily computed. The same procedure is repeated 
to get a degraded polarization tree and to compute E t j. The 
construction of fully polarized codes can be done according 
to either lower bounds or upper bounds on the probability of 
error of the bit-channels at the last level of polarization. For 
instance, in case of using upper bounds, bit-channel are sorted 
according to their error probabilities E n j in ascending order. 
Accumulate as many good bit-channels in the set F, such that 
E rl) j < E, where E is the target FER. Then, the FP code 
is defined by T and has rate R = |T|/iV. 

In proposed good-channel relaxed polar codes, a node will 
not be further polarized if the upper bound on its bit-channel 
error probability is lower than a certain threshold E g . For bad- 
channel relaxed polar codes, a bit-channel will not be further 
polarized if the lower bound on its error probability exceeds 
an upper threshold <%. Numerically, it was found for BMS 





































































9 


channels that the error performance of the constructed code is 

closer to that of the upper-bound calculated using the degraded 

channel. Hence, when polarization is relaxed for a node, the 

error probability of the children of a non-polarized node is set to 

the degraded-channel error probability of the parent. As a result, 

the procedure for designing relaxed polar codes of length N at 

a target FER E on general BMS channels is specified below. 

Each node j at level t in the polarization tree is associated with 

a label Relaxedf/, j) which is initialized to 0, and will be set to 

1 only if this node will not be polarized. The error probability 

(EP) of each node in the RP tree is initialized to that of the 
__ 

fully polarized tree E t g = E tg . The relaxed polar code will be 
defined by its good channel set T^. 


Algorithm 1 Relaxed Construction for General BMS Channels 
1 : Stage 1: Calculate AC-RP bit-channel EP for target FER E 
and rate R 

2 : Set the GC relaxation threshold as £ g = E/ ( Rl ) 

3 : Set the BC relaxation threshold as £i, = 

7T- 1 (1 - H(£ g )) 

4 : for t = 1 : n do 
5 : for j = 1 : 2* do 

6 : if Relaxed)/ — 1, |~j/2~|) = 1 then 

7: Relaxed)/, j) = 1, Efj = E? Aj/2 f| 

8: else if {E t _j < £ g } or {E_t,j > £fc} then 

9: Relaxed)/, j) = 1 

10 : end if 

11 : end for 

12 : end for 

13 : Stage 2: Construct AC-RP code with rate R 

14 : Sort bit-channel EPs E t g in ascending order 
15 : Select T# to have the RN bit channels with the 
smallest EP 


For the case of AWGN channel, first the channel parameter is 
calculated to satisfy the condition C > R, where R is the target 
rate and C is the capacity of the channel. Then, the channel 
is quantized using the method of f 22 l to get a channel with 
discrete output alphabet. Then Algorithm 1, discussed above, 
will be applied to this channel. 

With target FER E, the good-channel relaxation threshold 
is chosen to be £ g = E/(RN) to satisfy the target FER, 
i.e. JA g r H E n j < E. Let H(Ew ) be the entropy of the 
channel W with fidelty E\y, such that H(Ew) = 1 — I(W), 
where I(W) is capacity of channel W. Then, the bad-channel 
relaxation threshold is chosen such that H(£b) = 1 — // {£ g ). 
For general BMS channels W with error probability TV, the 
approximation H(W) ~ h- 2 (Ew) can be used, based on 
the inequality H(W) < h, 2 (Ew) G), l(24l . where h, 2 (p) = 
— p\og 2 {p) — (1 — p) log 2 (l — p) is the binary entropy func¬ 
tion. To guarantee that there is no rate loss from bad-channel 
relaxation if > £ b } is satisfied at Step 13, then relaxation 
may only be done after verifying that the lower bound on the 
error probability of the best upgraded descendent channel of 
that node is still higher than £ g , which will be satisfied for 
practical frame errors E. 

The same procedure above can be used to construct GC- 


All-Channel Relaxed Polarization Bounds, Target-FER =1e-005 



Fig. 6. Bounds on the all-channel relaxed polarization complexity reduction 
with target FER of 10“ 5 . 


RP codes, by neglecting the bad-channel relaxation condition 
{E t j > £ b } in step 13. 

In case of erasure channels with erasure probability p , the 
channel parameter (erasure probability) for the target channel 
capacity is p = 1 — C. The upper and lower bound on the 
bit-channel error probabilities coincide, and can be calculated 
exactly by the BPs, E t j = Z t j/2. To calculate the relaxation 
thresholds, the error probability is E = p/2, and the entropy is 
H{E) = 2 E. 

Algorithm 1 constructs the relaxed polar codes for general 
BMS channels, using bounds on the bit-channel error proba¬ 
bility. However, for short block lengths, the bit-channel error 
probability can be numerically calculated to be E t j using 
Monte-Carlo simulations, assuming a genie-aided successive 
cancellation decoder. In such a case. Algorithm 1 is modified 
by letting E tJ = E t ,j = E t j. 

B. Decoding of relaxed polar codes on general BMS channels 

Decoding of relaxed polar codes can be done by a modified 
successive cancellation decoder. For a polar code of length N 
and BMS channel W, suppose that is the input vector and 
yf is the received word. 

Consider a relaxed polar code constructed as explained in 
the previous subsection. At each level t = log N', for 1 < 
i < N, Relaxed(t, i) = 1 means that is not polarized 
and Relaxed)/;, i) = 0 means that wi^l is fully polarized. In 
practical application of relaxed polar codes, the decoder will 
have prior knowledge of the polarization map by Relaxed)/, /), 
which requires at most storage of 2 N bits. For communication 
systems, the polarization map can be specified by the communi¬ 
cation standard, similar to the specification of the parity-check 
matrices of block codes. 

For i = 1, 2,..., N, the decoder computes the likelihood ra¬ 
tio (LR) L$ of it*, given the channel outputs y^ and previously 
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decoded bits u\ 


2-1 


L$(y?,u\- 1 ) = 




4=0) 


For FP polar codes, Ankan observed that calculation of 
the LRs at length N require another N calculations at the 
parent node at length (N/ 2), where the LRs from the pair 
(are assembled from 

-0) U. N / 2 n2i-2 tr, f,2i-2\ T (*) („,N 


-2i—2 


the pair u{] e ' z ®u{] 0 2 ), L% /2 {y* N/2+1 , u l e 

via a straightforward calculation using the bit-channel recur¬ 
sion formulas for n > 1 ED- The relaxed successive can¬ 
cellation decoder (RSCD) follows the same recursion. Hence, 
if Relaxed)/., i) = 0, the likelihood ratio (LR) can be 
computed recursively as follows. 


L 


N 




(28) 


1 + L% 2 {y^\uf- 2 © Ko 2 )^/M /2+ x , *£*_) 

) 




-(i) (.N/2 

J N/2\ 

,,N s.2i—l\ 


T y> u, 1 ' 1 r* tfi-2 t , r« /..n 

^*.rio_\y 1 i u l,e ““l,o ) ' Ll N/2\y 


f, 2 i - 2 

JV/2+1’ u l,e 


'kjV/2'.i’l ’ “l,e ® “ 1,0 ) 


(29) 


1 — 2lL2i — 1 


^N/2 (yN/2+1’ u 


2i—2\ 


Otherwise, if Relaxed)/:, i) = 1 the decoding equations are 
modified as follows: 

iST'V.af- 2 ) = L% 2 (y”l\u*f) (30) 

’(j/f.«?■*> = 4 > /2 (k"/!+i. (3D 

The hard-decision estimates at a parent node are calculated 
from the hard-decision estimates of its two children in a step 
similar to encoding. At the last stage when N = 1, the LRs 
are simply L^lyA = WiyAfy/WlyM.). At the end, hard 
decisions are made on L N (at the leaf nodes), except for frozen 
bit-channels where Ui = Ui = 0. 

From the above description, 2 N LR calculations for log N 
levels are required for decoding conventional FP polar codes. 
However, the decoding complexity of relaxed polar codes is 
linearly reduced by the ratio of relaxed nodes in the polarization 
tree, since no LR calculation is required at relaxed nodes. 

The relaxed successive cancellation decoding as discussed 
above is extended to perform the relaxed successive cancella¬ 
tion list (SCL) decoding of RP codes. The SCL decoding of 
polar codes is shown to have considerable improvement over 
the regular SC decoding l25l - lf27l . In SCL decoder, instead of 
only one path, i.e. a sequence of decoded information bits, up 
to L decoding paths are considered at each decoding stage. The 
decoding paths are being updated as the decoder evolves. At 
each stage of the decoding process, where an information bit m 
is being decoded, both options of Ui = 0 and Ui = 1 are being 
considered and hence, the number of decoding paths is doubled 
to at most 2 L. Then this extended list of size up to 2 L is pruned, 
based on a maximum likelihood metric, to get a list of size L of 
the locally most likely paths. In the SCL decoding, there are up 
to L likelihood ratios of l/fi 1 at each node in the decoding trellis 
and then up to L parallel recursive calculations, as in (l28l ) and 
(l29l ). are performed at the node. In the relaxed SCL decoding. 


if a node is relaxed, then there will be L parallel decoding 
equations as in (l30l > and (|3T1 >. The operations for picking the 
most likely paths remain the same for relaxed SCL. 


C. Relaxed polarization versus simplified successive cancella¬ 
tion decoding 

Whereas relaxed polarization results in a construction of a 
code different from that obtained by Artkan’s full polarization, 
the SSCD lfl9l is a simplified decoder for a specific code. In 
fact, as would be clarified below, the SSCD can also be used to 
further reduce the complexity and latency of decoding relaxed 
polar codes. 

By construction, relaxed polarization attempts to maximize 
the number of rate-one nodes and rate-zero nodes by relaxing 
the polarization of sufficiently good and sufficiently bad bit- 
channels, respectively. Rate-1 and rate-0 nodes are nodes which 
have all their descendants in the good channel set and the bad- 
channel set, respectively. As was clarified in Section IVI-BI no 
encoding or decoding operations are done at the relaxed nodes. 

The SSCD identifies the rate-1 and rate-0 nodes in the 
received code, and reduces the operations required to decode 
the corresponding constituent codes. Hence, SSCD does not 
offer complexity reductions at the encoder. Since a rate-0 node 
only has frozen bits at its output, its constituent tree does not 
need to be traversed when decoding since the leaf values are 
known a priori. The output bits of the tree rooted at a rate-1 node 
can be found by simple hard-decisions on the soft likelihood 
ratios at the rate-1 node. However, since these bit-channels were 
polarized at the encoder, the input bits at the rate-1 node need 
to be recovered with a step similar to re-encoding in order to 
recover the information bits. 

Hence, relaxed polarization offers complexity and latency 
reductions at both the encoder and the decoder, while SSCD 
only reduces the decoding complexity and latency compared 
to Artkan’s successive cancellation decoder. The decoding 
complexity reduction is calculated for the SSCD as for the 
relaxed code, where rate-1 nodes and rate-0 nodes contribute 
to the complexity reduction same as relaxed nodes, and the re¬ 
encoding complexity at the rate-1 nodes is neglected. 

Next, we compare the reductions in decoding latency. As 
described in the previous section, a polarized node requires 
three clock cycles to calculate the even and odd LRs, and 
then calculate its hard-decision estimate from the hard-decision 
estimates of its children pair using the encoding operation. 
Hence, the total decoding latency with successive cancellation 
decoding for a polar code of length N = 2 n can be assumed 
to be C(n) = 3 Y^i=o 2* = 3A^ — 3. Consider the RP 
code decoded with the RSCD. A BC relaxed node requires no 
operation and hence contributes nothing to the latency. A sub¬ 
tree of GC relaxed nodes requires only one clock cycle at its 
root to calculate its hard-decision estimates. Similarly, for the 
SSCD, rate-0 nodes contribute nothing to the latency, and a sub¬ 
tree of rate-1 nodes requires one clock cycle. However, since 
this rate-1 constituent code is fully polarized, if the root of the 
rate-1 constituent code is at level n — t, then additional t clock 
cycles are required for re-encoding to recover the information 
bits at the leaf nodes. 
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To this end, there are two important observations to make. 

Firstly, the SSCD can be combined with RSCD to decode RP 
codes. The SSCD as proposed in |fl9l is applied on Ankan’s 
fully polarized code, and will be noted as SSCD FR Since the 
relaxed polar construction as described above relaxes the nodes 
before determining the good-channel set, then there can exist 
rate-1 and rate-0 nodes in the resultant code which have not 
been relaxed. Then, this implies that SSCD can also be used to 
decode relaxed polar codes, where after determining the good- 
channel set, the rate-1 and rate-0 nodes are identified and the 
operations at rate-1 nodes and rate-0 nodes which have not be 
relaxed by construction will be simplified as in SSCD. Hence, 
combined SSCD and RSCD on RP codes, denoted by SSCD 
RP, will further reduce the decoding complexity of RP codes. 
Moreover, since RP codes are constructed to have more rate- 
1 and rate-0 nodes by the relaxation operation, SSCD RP will 
often have reduced decoding complexity compared to SSCD 
FP. 

Secondly, the relaxed polar code construction can be mod¬ 
ified such that all rate-1 and rate-0 nodes of the fully polar¬ 
ized code are relaxed at the encoder. This modified relaxed 
polarization (MRP) construction is done by first constructing 
Ankan’s fully polarized code, selecting the good-channel set 
according to the desired rate or target error probability, finding 
the modified relaxed polarization map as that which relaxes all 
the rate-1 and rate-0 nodes in the FP code, and then encoding 
according to the modified relaxed polarization map. The good 
channel set for the MRP code will be fixed as that in the FP 
code. If only the rate-1 nodes are relaxed, then the code is called 
GC-MRP. If both the rate-1 and rate-0 nodes are relaxed, the 
code is called AC-MRP. Since all rate-1 and rate-0 nodes are 
already relaxed, the combined SSCD-RSCD will not produce 
additional complexity or latency reductions compared to RSCD 
when decoding the MRP code. Furthermore, neglecting the re¬ 
encoding complexity required by the SSCD at rate-1 nodes, 
RSCD on the modified RP codes will have the same decod¬ 
ing complexity but lower decoding latency compared to that 
of SSCD on FP codes. MRP codes also have the additional 
advantage of having lower error rates than their corresponding 
FP codes, as shown in Lemma[5] 

The complexity reductions by RSCD RP, SSCD FP, SSCD 
RP are compared in Fig.|7]for two cases: GC-RP versus SSCD 
when applied to rate-1 nodes only, denoted by SSCD(l), and 
AC-RP versus SSCD when applied to both rate-1 and rate-0 
nodes, denoted by SSCD. The complexity reductions are shown 
for a code of length 2 16 on the binary erasure channel. The FP 
codes are constructed at a rate equals to 9/10 of the channel 
capacity. To construct the RP codes, the error probability E 
of the good-channel set of the FP code is calculated and used 
to calculate the relaxation thresholds by T g = 2E/\T\, and 
% = 1 — 7/. The asymptotic bound of CR with GC-RP is 
the capacity as proved in Theorem[6] and is also shown. It can 
be noted that the results of the figure are inline with the dis¬ 
cussion above. When considering the GC-RP and rate-1 nodes 
only, RSCD GC-RP can offer higher complexity reduction than 
SSCD(l), especially at higher rates. However, after taking the 
bad-channel nodes and the rate-0 nodes into account, SSCD 
FP has higher complexity reduction than RSCD RP which 


implies that the bad channel relaxation threshold can be made 
more aggressive without degrading the performance. Across all 
rates, the combined RSCD-SSCD on RP codes has the highest 
complexity reduction. RSCD on the MRP codes is shown in 
Fig. [S]to have the least latency compared to SSCD on FP codes, 
and RSCD on RP codes. It is observed that in case of GC- 
RP, the decoding latency decreases at higher code rates due 
to the increase in the number of relaxed nodes. For AC-RP, 
the latency increases with the code rate since GC-relaxed (or 
rate-1) nodes require more latency than BC-relaxed (or rate-0) 
nodes, as described above. It has also been observed that the 
percentage of latency reduction increases with the code length. 


Decoder Complexity Reduction with Relaxed Polarization and SSCD, n =16 



Fig. 7. Complexity reduction ratios of RP codes and SSCD at different code 
rates and code length 2 16 on the BEC channel. 


Decoder Latency Reduction with Relaxed Polarization and SSCD, n=16 



Fig. 8. Latency reduction ratios of RP codes and SSCD at different code rates 
and code length 2 16 on the BEC channel. 
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Complexity Reduction & Rate at Target FER, C=0.7 



Fig. 9. Complexity reduction ratios on AWGN channel at code lengths N = 
10. and N = 14 at different code rates 


D. Performance on the AWGN channel 

The achievable complexity reduction is analyzed by actual 
construction of the relaxed polar codes on AWGN channels in 
Fig. [9] An AWGN channel with binary-input capacity C = 0.7 
is used to calculate upper and lower bounds on the bit-channel 
error probabilities. The CR at different code lengths 2 10 , and 
2 14 are logged at different target FER E. The rate, achievable 
by construction of the FP code at each target FER E, is also 
logged. It is observed that at a larger target FER E, a higher 
rate is possible, due to possible accumulation of more good- 
channel bits. The CR also increases with the target FER E, 
due to the increase of the relaxation threshold £ g , despite the 
increase in the code rate. Since the number of polarization 
levels increases with the code block length N, the achievable 
CR from RP increases with N. The effect of CR due to bad- 
channel relaxation becomes more visible, at higher target FER, 
as £), also becomes lower. 

The error-rate performance by actual relaxed successive 
cancellation decoding of the RP codes is shown in Fig. m 
for binary phase shift keying (BPSK) on an AWGN channel 
with variance o 2 as a function of the signal to noise ratio 
SNR= 101og 10 (l/cr 2 ). Practical code length of N — 4096 is 
assumed with near half-rate code of R = 0.59, respectively. 
The code is constructed with Algorithm 1, assuming an AWGN 
channel with binary-input capacity C = 0.7, and with E = 0.1. 
For simpler comparison, both the GC-RP and AC-RP codes use 
the same good bit-channel set found by construction (Stage 4) 
of the AC-RP code. However, the FP good-channel set is opti¬ 
mized for the FP polar code. It is observed that the frame error 
rate (FER) and bit error rate (BER) performances of the GC- 
RP code are similar to those of the AC-RP code. Although, the 
RP codes can have a slightly higher FER than the FP code due 
to different information sets, it is observed that the RP codes 
have lower information bit error rates than the FP codes. This 
verifies the proper design and selection of relaxation thresholds 



Fig. 10. Error performance of relaxed polar codes by BPSK on AWGN 
channel. The code length is 2 12 and the code rate is 0.59 

for the RP codes. Another important observation is that the 
proposed construction of relaxed polar codes is robust enough, 
such that it performs well over the whole range of simulated 
SNRs, although the codes are constructed for a certain SNR. 

The performance of the relaxed SCL decoder is compared 
with the performance of the regular SCL decoder for a numeri¬ 
cal example. The simulations are done for the code block length 
N = 1024, and rate R = 0.5. First, the fully polarized polar 
code of rate 0.5 is constructed for an AWGN channel at an 
SNR = 2 dB. Then, the all-channel modified relaxed polar code 
is constructed by considering the same set of information bit 
indices. The complexity reduction ratio of the modified relaxed 
polar code from the good channel relaxation only is 0.1639 and 
from the all channel relaxation is 0.3340. Regular SC decoding 
of the FP code and RSCD of the RP code are simulated and 
compared over the AWGN channel for the constructed codes, 
which corresponds to the case with list size L = 1. Further¬ 
more, the relaxed SCL decoder, as discussed in Section IVl-BI 
and the regular SCL decoder, with a maximum list size of 32 are 
simulated and compared as well. For list decoding, the polar 
information bits are concatenated with a CRC code of length 
16, where the rates are adjusted accordingly so that the actual 
information rate is 0.5, i.e., the information block length of the 
polar-CRC code is increased to 512 + 16 = 528. The simulation 
results are shown in Fig.[TT]and show about 1 dB SNR gain with 
list decoding. It is observed that with successive cancellation 
decoding, the RP code has a slightly better FER than the FP 
code. Since the RP code has the same information set as the FP 
code, this can be justified by Lemma [5] Furthermore, a better 
bit error rate (BER), with up to 0.2 dB SNR, is observed for the 
RP code compared to FP code. 

VII. Conclusion 

In this work, a new paradigm for polar codes, called relaxed 
polar coding, is investigated. In relaxed polar codes, a bit- 
channel will not be further polarized if it has been already 
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Fig. 11. Performance of relaxed successive cancellation list decoding on 
AWGN Channels, the code length is 2 10 and the polar code rate is 0.5 at L = 1 
without CRC bits. 


numerical results showed that there is room to derive tighter 
bounds. Due to the recursive calculations required to calculate 
the Bhattacharyya parameters at an arbitrary bit-channel, then 
the exact bounds are expected to be recursive in nature. For 
general BMS channels, it is more difficult to get closed form 
bounds as the polarization results in bit-channels with expo¬ 
nentially large alphabets. Another issue is that we considered 
the construction of relaxed polar codes based on Ankan’s 2x2 
polarization matrix. This construction can be readily extended 
to the general case of l x l polarization matrices, where l > 3 
cm Also, it is noted that relaxing the good bit-channels 
results in rate-1 constituent codes. These bit-channels can be 
further concatenated with other codes to further reduce the 
error probability. In fact, the interleaved concatenation scheme 
for polar codes 03 adaptively concatenates better bit-channels 
with outer codes whose rates are higher than those concatenated 
with the worse bit-channels, in order to maintain the target 
code rate or the target error performance of the concatenated 
code. When constructing concatenated relaxed polar codes, the 
adaptive concatenation scheme can be modified to take into 
account, or jointly optimize, the selection of the relaxed bit 
channels. 


polarized to be sufficiently good or sufficiently bad. Hence, 
encoding and decoding of relaxed polar (RP) codes have lower 
computational and time complexities than those of conventional 
polar codes. RP codes also have lower space complexity than 
conventional polar codes in fixed point hardware implemen¬ 
tations, due to the less number of bits required to store the 
likelihood ratios. This has the compound effect of decoder 
implementations with less power consumption. It is proved in 
this work that, similar to conventional polar codes, RP codes 
are capacity achieving. It is also shown that with proper design, 
RP codes will have lower error rates than conventional polar 
codes of the same rate. Constructions of RP codes on the binary 
erasure channel, and on general BMS channels are described. 
Asymptotic and finite-length bounds on the complexity reduc¬ 
tion achievable by relaxed polar coding are derived and verified 
for the binary erasure channel against actual constructions. The 
relaxed successive cancellation decoding (RSCD) for relaxed 
polar codes is described. The successive cancellation list de¬ 
coder for polar-CRC codes ll25l . Il26l is also modified for list¬ 
decoding of relaxed polar-CRC codes. Moreover, we discuss 
how simplified successive cancellation decoding ED, can be 
done on top of RSCD to further reduce the decoding complexity 
and latency of RP codes. For a code of rate 0.3 and length 
2 16 , the results show an 50.9% decoding complexity reduction 
ratio and an 93.5% decoding latency reduction ratio are possible 
with relaxed successive cancellation decoding of RP codes on 
the BEC. It is verified by numerical simulations on the AWGN 
channel that the information bit error rates of properly designed 
RP codes are at least as good as those of conventional polar 
code with the same rate. 

Next, we discuss possible directions for future work. 
Whereas the derived bounds on the complexity reduction ratios 
on the BEC channel have explicit closed form formulas, the 
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Appendix 

For any DMC W, let E(W) denote the probability of error 
of W under ML decoder. Let W : SC —> W be a BMS channel, 
where SC is the binary alphabet. Then 

E(W) = Y ^ mm{W(y\0), W(y\l)} 

yew 

Let the channel W be polarized using the Ankan’s Butterfly. 
Let the polarized bit-channels be denoted by W + and W~. 
Then it is known that 

W~(y 1 ,y 2 \ui) = - Y W(yi\ui ® u 2 )W(y 2 \u 2 ) (32) 

U 2 

and 

W + (y 1 ,y 2 ,u 1 \u 2 ) = © u 2 )W(y 2 \u 2 ) (33) 


Proof: Suppose that the size of output alphabet is M. 


Let ftf = 

{yi, 2/2, Vm}- Then for i = 1, 2,..., M, let 



ai = min {W (t/|0), W (y|l)} 


and 

bi = max{W(y|0), W(t/|l)} 


Then 

1 M 

E(W) = -Y*i 
* 2=1 

(36) 

and 

M M 



Y a i+Y bi = 2 

(37) 


2=1 2=1 


The size 

of output alphabet of W~ is M 2 . For any 

pair 


i.Vi,yj) G 

{ W~ ( yi , yj |0), W~ (; yi , y 3 • 11)} 

= I ~( a i a j 3" bibj), — (oj6j + aj&i) 

Notice that + bibj > aibj + ajbi. Therefore, 

1 m m i m m 

e(w~) = ~YY^ +a ^ = fJ2 Ui ^Y b ^ 

i= 1 j =1 3 = 1 j=l 

= E(W)(2 - 2E(W)) 


where the last equality follows by (l36l > and ( |37T> . This proves 
the first part of the lemma. 

The size of the output alphabet of W + is 2 M 2 . For any pair 
(yuVj) £ r 3T 2 , there are two corresponding elements in the 
output alphabet of W + . Then 

{{W + {y u y 0 ,m,W + {y uyj M^} 

,{W + ( yi , yj ,m,W + (y l: y J: l\l)}}= (38) 



Then 

1 m m i M M 

2 E(W + ) = - Y Y “‘“.J + 2 Yl Yl min Wl- a 3 b '} 

i— 1 j—1 i—1 j=1 

MM M 

= £ ai) 2 = 4 E(W ) 2 

2 = 1 j= 1 2 = 1 

which proves the second part of the lemma. ■ 


Lemma 20: For any BMS W, 

E(W~) = 2 E(W) - 2E(W) 2 (34) 


and 


E(W + ) > 2 E{W) 2 


(35) 


